An empirical comparison and characterisation of nine popular clustering methods
نویسندگان
چکیده
Nine popular clustering methods are applied to 42 real data sets. The aim is give a detailed characterisation of the by means several cluster validation indexes that measure various individual aspects resulting clusters such as small within-cluster distances, separation clusters, closeness Gaussian distribution etc. introduced in Hennig (in: Data analysis and applications 1: regression, modeling—estimating, forecasting mining, ISTE Ltd., London, 2019). 30 sets come with “true” clustering. On these similarity clusterings from nine explored. Furthermore, mixed effects regression relates observable clusterings, which problems unobservable. study gives new insight not only into ability discover but also properties can be expected methods, crucial for choice method situation without given
منابع مشابه
on the comparison of keyword and semantic-context methods of learning new vocabulary meaning
the rationale behind the present study is that particular learning strategies produce more effective results when applied together. the present study tried to investigate the efficiency of the semantic-context strategy alone with a technique called, keyword method. to clarify the point, the current study seeked to find answer to the following question: are the keyword and semantic-context metho...
15 صفحه اولPopular Ensemble Methods: An Empirical Study
An ensemble consists of a set of individually trained classifiers (such as neural networks or decision trees) whose predictions are combined when classifying novel instances. Previous research has shown that an ensemble is often more accurate than any of the single classifiers in the ensemble. Bagging (Breiman, 1996c) and Boosting (Freund & Schapire, 1996; Schapire, 1990) are two relatively new...
متن کاملinvestigation of single-user and multi-user detection methods in mc-cdma systems and comparison of their performances
در این پایان نامه به بررسی روش های آشکارسازی در سیستم های mc-cdma می پردازیم. با توجه به ماهیت آشکارسازی در این سیستم ها، تکنیک های آشکارسازی را می توان به دو دسته ی اصلی تقسیم نمود: آشکارسازی سیگنال ارسالی یک کاربر مطلوب بدون در نظر گرفتن اطلاعاتی در مورد سایر کاربران تداخل کننده که از آن ها به عنوان آشکارساز های تک کاربره یاد می شود و همچنین آشکارسازی سیگنال ارسالی همه ی کاربران فعال موجود در...
An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering
Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...
متن کاملAn empirical comparison of three inference methods
In this paper, an empirical evaluation of three infer ence methods for uncertain reasoning is presented in the context of Pathfinder, a large expert system for the diagnosis of lymph node pathology. The inference procedures evaluated are (1) Bayes' theorem, a:ssum ing evidence is conditionally independent given each hypothesis, (2) odds-likelihood updating, assuming evidence is conditionally ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Advances in data analysis and classification
سال: 2022
ISSN: ['1862-5355', '1862-5347']
DOI: https://doi.org/10.1007/s11634-021-00478-z